Picture for Ziyong Feng

Ziyong Feng

Efficient, Validation-Free Intrinsic Quality Estimation for Large-Scale Face Recognition Datasets

Add code
May 28, 2026
Viaarxiv icon

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Add code
May 25, 2026
Viaarxiv icon

UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

Add code
Apr 16, 2026
Viaarxiv icon

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Add code
Feb 09, 2026
Viaarxiv icon

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

Add code
Jan 15, 2026
Viaarxiv icon

ProCLIP: Progressive Vision-Language Alignment via LLM-based Embedder

Add code
Oct 22, 2025
Viaarxiv icon

UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction

Add code
Oct 02, 2025
Figure 1 for UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
Figure 2 for UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
Figure 3 for UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
Figure 4 for UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
Viaarxiv icon

Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval

Add code
Sep 11, 2025
Viaarxiv icon

Region-based Cluster Discrimination for Visual Representation Learning

Add code
Jul 26, 2025
Viaarxiv icon

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Add code
Apr 24, 2025
Viaarxiv icon